Exemplar Driven Character Recognition in the Wild

نویسندگان

  • Karthik Sheshadri
  • Santosh Kumar Divvala
چکیده

Character recognition in natural scenes continues to represent a formidable challenge in computer vision. Traditional optical character recognition (OCR) methods fail to perform well on characters from scene text owing to a variety of difficulties in background clutter, binarisation, and arbitrary skew. Further, English characters group into only 62 classes whereas many of the world’s languages have several hundred classes. In particular, most Indic script languages such as Kannada exhibit large intra class diversity, while the only difference between two classes may be in a minor contour above or below the character. These considerations motivate an exemplar approach to classification; one which does not seek intra class commonality among extreme examples which are essentially sub classes of their own. Exemplar SVM’s have been recently introduced in the object recognition context. The essence of the exemplar approach is that rather than seeking to establish commonality within classes, a separate classifier is learnt for each exemplar in the dataset. To make individual classification simple, linear SVM’s are used and each classifier is hence an exemplar specific weight vector. Each exemplar in the dataset is resized to standard dimensions, and thence HOG features are densely extracted to create a rigid template xE . A set of negative samples NE are created by the same process from classes not corresponding to the exemplar. Each classifier (wE ,bE ) maximizes the separation between xE and every window in NE . This is equivalent to optimizing the convex objective[4]:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SHESHADRI ET AL.: EXEMPLAR DRIVEN CHARACTER RECOGNITION IN THE WILD 1 Exemplar Driven Character Recognition in the Wild

Character recognition in natural scenes continues to represent a formidable challenge in computer vision. Beyond variation in font, there exist difficulties in occlusion, background clutter, binarisation, and arbitrary skew. Recent advances have leveraged state of the art methods from generic object recognition to address some of these challenges. In this paper, we extend the focus to Indic scr...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Exemplar-based Action Recognition in Video

Over recent years, a lot of progress has been made towards automatic annotation of video material, especially in the context of object and scene recognition. However, in comparison, action recognition is still in its infancy. Whereas originally silhouette-based approaches or approaches based on pose estimation have been studied mostly, good results have been reported recently using extensions o...

متن کامل

Devanagari Character Recognition in the Wild

This papers examines the issues in recognizing the Devanagari characters in the wild like sign boards, advertisements, logos, shop names, notices, address posts etc. While some works deal with the issues in recognizing the machine printed and the handwritten Devanagari characters, it is not clear if such techniques can be directly applied to the Devanagari characters captured in the wild. Moreo...

متن کامل

Exemplar based approaches on Face Fiducial Detection and Frontalization

Computer vision solutions such as face detection and recognition, facial reenactment, facial expression analysis and gender detection have seen fruitful applications in various domains such as security, surveillance, social media and animation. Many of the above solutions have common pre-processing steps such as fiducial detection, appearance modeling, face structural modelings etc. These steps...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012